Journal: Nature
Article Title: Bat genomes illuminate adaptations to viral tolerance and disease resistance
doi: 10.1038/s41586-024-08471-0
Figure Lengend Snippet: a , PacBio HiFi sequencing improves genome assembly contiguity. UCSC genome browser screenshot showing a 700 kb locus of the human chr20 and genome alignments to six rhinolophid bats (boxes in the alignment net represent aligning sequence and connecting lines deletions or unaligning sequence). The region highlighted in grey does not align between human and Rhinolophus ferrumequinum , because the R. ferrumequinum PacBio CLR-based assembly has a large 421,369 bp assembly gap in this locus. As a result, several genes contained in this locus are missing from this assembly, which contributes to the slightly higher number of missing genes for R. ferrumequinum vs. other rhinolophid bats (Fig. ). Consistent with PacBio HiFi-based assemblies often having higher contig N50 and N90 values (Fig. ), the other four HiFi-based rhinolophid assemblies have a contiguous sequence in this locus without an assembly gap. Rhinolophus sinicus , which was assembled from Illumina short reads, has 24 smaller assembly gaps with sizes 15–1058 bp in this locus. b-f , Lower assembly quality leads to missed signals of gene selection. To explore the effect of assembly quality on results from genome-wide selection screens, we replaced the high-quality Bat1K assembly (HLrhiFer5) of the Greater horseshoe bat ( Rhinolophus ferrumequinum ) with a previous short read based assembly (rhiFer1) of the same species that has a higher degree of incompleteness and fragmentation (panel b-d). We kept all other 114 mammalian species and tested if the genes under selection in the R. ferrumequinum branch with the HLrhiFer5 assembly are also under selection when using the rhiFer1 assembly. Similarly, we replaced the Bat1K assembly (HLrouAeg4) of the Egyptian rousette ( Rousettus aegyptiacus ) with a previous assembly (Raegyp2.0) that used long and short read data but has a smaller contig N50 and an excess of inactivating mutations (indicative of a higher base error rate; see Supplementary Fig. ). We identified 272 vs. 133 genes under positive selection in HLrhiFer5 vs. rhiFer1, and 299 vs. 194 genes under positive selection in HLrouAeg4 vs. Raegyp2.0, indicating that lower assembly quality hampers the identification of selected genes. To illustrate this, the panels show UCSC genome browser screenshots of five immune-related genes, where we detected positive selection in the Bat1K but not the previous assembly. The first three examples (b-d) show cases where the previous assembly does not cover the gene on a single scaffold and exons are missing because of assembly gaps. The last two examples (e-f) show how assembly problems other than assembly gaps hamper selection screens. b , LAT2 (linker for activation of T cells family member 2), a regulator of T cell activation , is split across two different scaffolds in rhiFer1, as shown by the two alignment chains between human (hg38 assembly) and R. ferrumequinum rhiFer1. Importantly, while other methods can capture only one of these gene fragments at best, TOGA recognizes both alignment chains as orthologous and joins both gene fragments, resulting in a more complete codon alignment. Nevertheless, coding exon 8 (blue highlight) is missing in rhiFer1, thus this exon is missing in the codon alignment. In HLrhiFer5, all exons align to a single scaffold. c , PPP6C (protein phosphatase 6 catalytic subunit), a factor that regulates STING phosphorylation and activation , is split across two scaffolds (alignment chains) in rhiFer1. While TOGA recognizes the red chain as an orthologous fragment of PPP6C , the inset shows that coding exon 1 overlaps an assembly gap in rhiFer1 (but not HLrhiFer5), thus this exon will be missed in the codon alignment. d , The gene locus of FOXP3 (forkhead box P3), a master regulator involved in regulatory T-cell development and function , is split across several scaffolds in rhiFer1. Although the entire coding region is present on a single scaffold (brown chain), the last coding exon overlaps an assembly gap in rhiFer1 and thus will be missed in the codon alignment. e , CD48 (CD48 antigen), a cell surface factor involved in adhesion and activation of adaptive immune cells , lacks an aligning exon 1 in the Raegyp2.0 assembly. Compared to the HLrouAeg4 assembly, Raegyp2.0 lacks ~22,800 bp of sequence and this sequence is also present in other Pteropodid assemblies. This indicates that this ‘deletion’ is likely an assembly error in Raegyp2.0. f , MX1 (MX dynamin like GTPase 1), an interferon-induced antiviral gene , has two orthologous alignment chains that cover the gene. While this apparent ‘duplication’ is likely due to incomplete haplotype purging in Raegyp2.0, it leads TOGA to classify MX1 as a 1:2 ortholog in this assembly and since our screen only considers 1:1 orthologs, this gene is missed in a screen including Raegyp2.0.
Article Snippet: For ISGylation detection, protein samples were mixed with 4× NuPAG LDS sample buffer (Invitrogen, NP0007), separated by NuPAGE 4–12% Bis-Tris gels in running buffer (Invitrogen, NP0001) for 70 min under 120 V, and transferred (Invitrogen, NP000061) for 90 min under 100 V. The following antibodies were used for detection: rabbit anti-MX1 polyclonal antibody (clone N2C2, Genetex, GTX110256, dilution 1:1,000); rabbit anti-ISG15 polyclonal antibody (middle region, Aviva Systems Biology, ARP59386_P050, dilution 1:1,000); rabbit anti-GAPDH monoclonal antibody (clone 14C10, Cell Signaling, 2118, dilution 1:2,000); rabbit anti-CD13 polyclonal antibody (Sino Biological, 10051-T60, dilution 1:2,000); rabbit HCoV-229E nucleocapsid polyclonal antibody (Sino Biological, 40640-T62, dilution 1:2,000); mouse anti-MYC monoclonal antibody (Sino Biological, 100029-MM08, dilution 1:2,000 for cell lysate and 1:1,000 for cell supernatants); rabbit anti-UBE1L monoclonal antibody (Huabio, HA721228, dilution:1:500); rabbit polyclonal anti-UBE2L6 antibody (Abclonal, A13670 ); rabbit polyclonal anti-HERC5 antibody (Abclonal, A14889 ); and HRP-conjugated goat anti-rabbit IgG (Transgen Biotech, HS101-01, dilution 1:5,000).
Techniques: Sequencing, Selection, Genome Wide, Activation Assay, Phospho-proteomics